AI-Based Syntactic Pattern Recognition of Sequences
نویسندگان
چکیده
This patent concerns the traditional problem encountered in the syntactic Pattern Recognition (PR) of strings or sequences. The primary investigator involved in this work is a Full Professor at Carleton University in Ottawa, Canada, and is a Fellow of the IEEE. The primary problem solved by the invention involves determining the string or sequence that is most similar to a sequence presented to the system. The search could be initiated by presenting, to the system, a noisy or inexact version of a string contained in memory for example, at a web-site or in the library or database. The invention will yield the closest string/sequence by searching the dictionary of possible words using a newly invented AIbased strategy. The core of this invention is this search strategy, called the Clustered Beam Search. Experiments have been done to show the benefits of the CBS over the current state-ofthe-art, and the results demonstrate an unbelievably marked improvement (sometimes as high as 90%) for large libraries and databases. The solution provided by the invention would be applicable in numerous areas including : Inexact or proximity searching on the Internet, keyword-based search in libraries and databases, spelling correction, speech and character recognition (including optical character recognition), and the processing of biological sequences, for example, in human genome projects. These applications are briefly described below. ∗This author can be addressed at: School of Computer Science, Carleton University, Ottawa, Canada : K1S 5B6. e-mail address : [email protected]. †Professor and Fellow of the IEEE. This author can be contacted at: School of Computer Science, Carleton University, Ottawa, Canada : K1S 5B6. e-mail address : [email protected]. More information about this inventor, who holds a Doctorate from Purdue University, can be found at www.scs.carleton.ca/∼oommen.
منابع مشابه
AI and Pattern Recognition
This panel deals with the relationships between artificial intelligence and pattern recognition. It addresses such issues as: How do the fields differ? How should they relate to one another? In particular, does AI have anything to learn from PR? Pattern recognition, classically, involves the extraction of features and their subsequent analysis in feature space using statistical tools or cluster...
متن کاملSteel Consumption Forecasting Using Nonlinear Pattern Recognition Model Based on Self-Organizing Maps
Steel consumption is a critical factor affecting pricing decisions and a key element to achieve sustainable industrial development. Forecasting future trends of steel consumption based on analysis of nonlinear patterns using artificial intelligence (AI) techniques is the main purpose of this paper. Because there are several features affecting target variable which make the analysis of relations...
متن کاملSyntactic Pattern Recognition Systemfor Dna
We review both theoretical and practical results of a linguistic approach to studying the structure of features of DNA sequences. Using generative grammars, complex assemblages can not only be described and analyzed abstractly, but also concretely, such that features can be searched for by a general-purpose parser. Our parser, called GenLang, uses an extended logic grammar formalism and has fou...
متن کاملIntegrating AI with Sequence Analysis
This chapter will discuss one example of how AI techniques are being integrated with, and extending, existing molecular biology sequence analysis methods. AI ideas of complex representations, pattern recognition, search, and machine learning have been applied to the task of inferring and recognizing structural patterns associated with molecular function. We wish to construct such patterns, and ...
متن کاملA Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features
Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...
متن کامل